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AIMS OF THE BOOK 

The primary aim of this book is to provide a compendium of the moh 
found at leucocyte surfaces. The book includes entries for 139 mole' 
the entries there are three chapters that provide a perspective for tj 
cell surface molecules. These chapters deal with the dimensions 
found at cell surfaces, with the protein superfamilies into whicl 
molecules can be grouped, and with the linkages of the genes 
molecules. 

A systematic approach to the naming of leucocyte antigens was r 
"cluster of differentiation" designation for human leucocyte antig 
with monoclonal antibodies (rnAbs] 1 . MAbs were submitted to w 
were placed into groups based on fluorescent labelling patterns 
leucocyte populations. In this classification, antigens are given a CI 
we provide in this book entries on all the CD antigens. We also incli 
antigens from any species which are known to be expressed on i 
leucocytes, that have been characterized at the level of their primary 
have not been given a CD number! Molecules that have been seque 
be at leucocyte surfaces but have not been identified immunochemi 
included. 

ORGANIZATION OF THE DATA 
Name 

The CD nomenclature is used wherever possible for both the humai 
species homologues. In some cases the antigens are known by severa 
- and- the most common of these are also given together with comir 
species homologues. The term homologue is reserved for equivalt 
between species and is not used to indicate related molecules in a sup 

Molecular diagram 

This gives a visual representation of the molecule, including 
membrane attachment, the protein domains that are present, and 
glycosylation. Figure 1 lists the symbols that are used to represei 
domains, the N- and O-linked oligosaccharide structures and the disi 
A summary of the domains present in the leucocyte molecules is gr 
which also includes the page numbers of the individual entries. 

For molecules attached to the cell surface by glycosyl-phosphatidy 
anchors, the anchors are shown in the diagrams by an arrow. The c 
molecules with protein transmembrane sequences are shown by 
intracellular terminus. I 

The criteria used for defining the superfamily relationships of pn 
are discussed in Chapter 3. In those cases where no superfamily i 
detected, the extracellular region or domain is shown as a circle 
question mark. The diameter of the circle is proportional to th 
extracellular region or domain. If the protein sequence contains a hi 
of Ser, Thr and Pro residues and is probably heavily O-glycosylated, 
an extended structure to distinguish it from regions likely to I 
conformation. It is not possible to show the positions of all the disul] 
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Figure 1 . Models for domains and repeats found in leucocyte membrane i neins. 
These models are used in the diagrams drawn for the entries in Section 11 
Additional abbreviations used in the domain diagrams are: S f signal seqi nce ; 
TM, transmembrane sequence; CY, cytoplasmic sequence; G, GPI anchoi ignal 
sequence. 
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Table 1. Distribution of domains in leucocyte surface molecules 
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Antigen nam! 

CD36 1 
CD37 | 
CD38 | 
CP39|NS) I 
CD40 ;j 
CD41 5 
CD42a 
CD42ba | 
CD42bB % 
CP43 j 
CD44 | 
CD45 | 
CD46 j 
CD47(NS) jj 
CD48 j 
CD49 $ 
CDw50jNS| 

CD51 | 
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CDS 5 j 
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CD63 jj 
CD64 | 
CDw65 (CH; 
CD66 % 
CD67 (NS) j 
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CDw70 (NSj 
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The data are for single polypeptides although some will form homo- or heterodimers. 
a a or p" chain. 

& The number of predicted passes through the membrane. 

c Whether or not the molecule has a GPI anchor. 

A Whether the molecule is a Type I or II transmembrane protein. \ 

NS Sequence not determined. 

CHO Determinant dependent on carbohydrate. 



the diagrams and these bonds are shown only for Ig-superfamily (IgSF) domains 
since this has been traditionally done in illustrating this superfamily. The majority 
of the cytoplasmic domains do not belong to defined superfamilies and are 
represented by wavy lines whose length is proportional to the sequence length. 

The N-linked glycosylation sites are those predicted from the cDNA sequences 
by the presence of the sequence AsnXaaThr or AsnXaaSer with the exceptions of 
AsnProThr/Ser, which are normally not glycosylated and AsnXaaSer/ThrPro which 
are not often glycosylated 2 ~ 4 . O-linked glycosylation occurs at Ser or Thr amino 
acids but there is no clear-cut sequence motif that invariably indicates this type of 
glycosylation. However, O-linked glycosylation is likely for stretches of sequence 
that include a preponderance' of the amino acids Ser, Thr, Pro 5 ' 6 and this is shown 
in the diagrams for regions where a high level of O-glycosylation is expected. In 
many cases there are biochemical data to support the assignment of O-glycosylation. 

Glycosaminoglycans (GAG) have been identified in two leucocyte membrane 
glycoproteins CD44 and syndecan. The number of these sites has not been 
confirmed biochemically and the symbols indicate only their approximate number 
and positions. The GAG has been characterized, for example/ as chondroitin 
sulphate in CD44 and this is shown in the diagram. 

Size of the processed form of the molecule 

The relative molecular weight (M r ] of the polypeptide backbone is calculated from 
the M t of the mature protein. This is the polypeptide without the signal sequences 
for secretion or attachment of a GPI anchor. In most cases the position of cleavage 
of the signal sequences has been predicted from comparison with consensus 
sequences from cases where cleavage points are known 7 . 
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The M r values obtained from polyacrylamide gel electrophoresis in sodium 
dodecyl sulphate under reducing and/or nonreducing conditions are given. There 
can be considerable variation according to the conditions used for the SDS gels and 
the value is typical of those In the literature. 

The degree of glycosylation 

The number of potential N-linked glycosylation sites in the extracellular portions 
of membrane glycoproteins is shown as determined by the presence of the 
consensus sequence AsrjXaaThr/Ser. To obtain a rough idea of the weight of the 
glycoprotein likely to be accounted for by N-linked oligosaccharides an average 
value of 3000 M r can be used per structure. It should, however, be noted that some 
oligosaccharides can be ( considerably larger due, for example, to the presence of 
. repeating kctosarnine units or complex blood group antigens. 

The number of O-linked sites cannot be predicted directly from the sequence and 
thus the following terms are used to describe the extent of O-glycosylation: 

( 1 ) Unknown: There are no data from sequence or immunochemical 
studies to indicate the level of O-glycosylation. 

(2} Nil: No O-linked oligosaccharides as determined by the absence of 

galactosamine in the composition of the molecule or the values for 
apparent M r on SDS gels do not exceed those accounted for by the 
polypeptide and N-linked oligosaccharides. 

(3) Probable +: Sequence data suggest the likelihood of O-glycosylation. 

(4| + abundant: There are sequence and/or biochemical data to indicate a high level 
of O-glycosylation. 

Gene location and size 

The chromosome location of the human gene from the Genome Data Base (Howard 

Hughes Medical Institute and Johns Hopkins University) or from ref . 8 is given 
together with its approximate size. If the gene structure has been determined in 
another species but not for humans, the gene size for the other species is given. 

Domain and exon organization 

A diagrammatic representation of the positions of domains and exons is given 
where it is informative. These diagrams are not drawn to scale. Domain positions 
are defined by indicating the positions of two key conserved residues within the 
domain. The first residue is shown with two flanking residues on the GOOH- 
terminal side and the second with two flanking residues on the NH2-terminal side. 
The sequences are given in the upper part of the domain diagram and can be used 
to identify the domain in the full sequence given at the^ end of the entry. The key 
conserved residues for each domain type are marked with asterisks in the sequence 
alignments given in Chapter 3 where they can be seen in relation to alignments for 
each superfamily. Internal positions are identified rather than trying to define the 
full domain since in some cases the end of one domain overlaps with the beginning 
of another (e.g. between GD4 domains 1 and 2 9 ). The identification of conserved 
internal residues should readily allow analysis of domains by comparisons with the 
alignments of sequences in Chapter 3. 

Exons are identified by three or four residues of protein sequence in the lower 
part of the diagram with the amino acid that is coded at the splice junction aligned 
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with the intron/exon boundary mark. The type of intron/exon boundary is 
indicated as type 1 (splicing after nucleotide 1 of the triplet), type 2 (splicing after 
nucleotide 2) and type 0 (splicing at the end of the codon) 10 . Thus if the junction 
was type 1 and a Trp residue was shown at the boundary of an exon/intron splice 
site, the point of junction would be after the T of the TGG codon. By comparing 
the intron/exon boundaries and the domain designations it should be clear whether 
a given domain is encoded within one exon. Where exons are known to be 
alternatively spliced a space is included between the rectangles that represent the 
exons; all other exons are drawn contiguously. 

Tissue distribution 

A brief description of the cell types that have been clearly shown to express an 
antigen is given. In many cases full analyses of tissue distributions have not been 
done. A failure to mention a cell type cannot be taken as meaning that it is known 
that the cell type does not express the antigen. The distribution of the CD antigens 
in humans is reviewed in Leukocyte Typing IV 11 . This reference is implied but not 
cited for each of the CD antigens entries. There are often considerable differences 
in the tissue distribution between species and thisHs indicated when data are 
available. A pullout diagram of the distribution of the human CD antigens on 
leucocyte populations is enclosed inside the back cover. 

Structure 

Biochemical and structural data are summarized in this section. 
Function 

Functional data are summarized in this section. 
Comments 

This section is used for any additional key points of interest concerning the 
molecule in question. For example, if the molecule is particularly closely related to 
another protein or alternative splicing of RNA leads to the production of additional 
forms of the molecule then this information is included. In addition motifs 
identified with particular functions are given. 

Database entries 

The PIR (Protein Identification Resource) and SWISSPROT databases for protein 
sequences and the GENBANK and EMBL databases for nucleic acid sequences are 
cited. Every new sequence submitted to the databases is given an accession 
number. These are then incorporated into entries which are given a specific code. 
One problem is that codes can change if, for instance, two entries are merged or 
when sequences become fully annotated. Therefore the only definitive way to 
identify a sequence is through the accession numbers and these are given in the 
entries. Where there are multiple accession numbers the primary number is given. 
In some cases only genomic sequence is available and is spread over several 
database entries. In these cases more than one accession number is given. For data 
recently sent to EMBL and GENBANK both databases use the same accession 
number. 

Most software can identify sequences through the accession number and 
sequences can also be obtained through electronic mail (EMAIL) directly from 
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automatic computers or servers at the database centres using the acces: m 
number. The procedures are shown below. 

Obtaining DNA and protein sequences from the EMBL and SWISSPROT 
databases 

The EMAIL address is: NETSERV@EMBL-HEIDELBERG.DE 

The commands used are: GET to specify the request and NUC or PROT to 

specify the EMBL or SWISSPROT databases, respectively. 

A typical file might look like: 

GET PROT:P0 1830 

GETNUC:X03152 

This file would request the sending of two entries, one from SWISSPROT 
(protein) and one from EMBL (DNA). 

Obtaining piotein sequences from the PIR databases 
The EMAIL address is fileserv@gunbrf.bitnet 

To retrieve an entry the file should incmdethe command GET together with he 
database code (not accession number). To obtain the code from the accession 
number, type the command ACCESSION then the number. A typical file wo d 
look like: 
GET TDRT 
ACCESSION A02107 

This file would request the sending of the entry for TDRT and a listing of th. 
codes that contain the accession number A02.107 

Obtaining DNA sequences from the GENBANK database 
The GENBANK service at retrieve@genbank.bio.net has been replaced by a 
server at National Center of Biotechnology Information (NCBI). The EMAIL 
address is: retrieve@ncbi.nlm.nih.gov 

The f Ue should contain the name of the database, the command BEGIN and ie 

accession numbers or key words. A typical file would look like this: 

DATALIB genbank 

BEGIN. 

J02852 

Note for further information on how to use these servers, send an EMAIL 
message to the servers containing the single cornmand 
HELP . * 

Th^hu^nan amino acid sequence is given if it is known. If a sequence has mly 

been obtained from another species, then this is given. The single letter co. tor 

amino acids is used (defined in Table 1 in Chapter 3). In most cases ' lere 

structural and functional data are discussed it is with respect to seq :nce 

numbering of the fully processed form of the molecule. If the NH 2 - termini has 

not been defined by sequence analysis the signal sequence for secretion is pre. sted 
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from comparisons with sequences where the position of signal cleavage ha 
determined. The consensus rules for defining cleavage sites are reviewed ii 
The signal sequence is shown on a separate line. For molecules likely t 
transmembrane sequences the proposed hydrophobic regions are underlin 
GPI anchors, signal sequences are predicted using criteria discussed in Ch 
and the signal sequence is shown on a separate line below the proposed pr< 
sequence. Variants produced by alternative splicing are difHcult to represi 
where it is clear that these are expressed the sequences of the alternative foi 
indicated. Allelic alternatives of the gene sequence are given where appropn: 

References . 

It is not feasible to give a comprehensive list of references. The references 1 
given are recent ones that should allow access to the rest .of the literature i 
references are highlighted in bold. In some cases these are recent reviews 
others it is the most recent paper conuining relevant references. There ai 
books in this FactsBook series that specialize on cytokines and their recept 
on adhesion molecules l^ 13 . 
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Molecular weights 
Polypeptide 36844 



SDS PAGE 
reduced 
unreduced 



45-58 kD 
45-58 kD 



C2 I 



Carbohydrate 

N-linked sites 3 
O-linked nil 

Human gene location and size 
lpl3 ; <28.5 kb * 



mil < w 
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Tissue distribution 

CD2 is expressed on virtually all T lymphocytes, NK cells and 
thymocytes W. In species other than man, CD2 is found on mouse B cells 
and rat and sheep splenic macrophages 4 . 



Structure 

The extracellular domain is made up of two IgSF domains and is in a subset 
within the IgSF together with its ligand, CD58 2 . The structure of the Non- 
terminal domain has been determined by NMR and shown to haye an IgSF 
V-like fold, but lacks the usually conserved disulphide bonds between the p; 
sheets s . The NH 2 -terminus of the mature polypeptide has been established 
by protein sequencing 1 . The cytoplasmic domain is rich in basic and proline 
residues and is well conserved across species 2 > 4 . The 3' end of the gene is 
necessary and sufficient for tissue-specific expression in transgenic mice *' 6 . 



Function 

CD2 interacts through its NH 2 -teirmnal domain with CD58 to form an 
adhesion pair, for example between T cells and cytotoxic targets or antigen 
presenting cells Activation of T cells by some pairs of CD2 mAbs is 
mediated by the cytoplasmic domain and is dependent on the expression of 
a functional TcR-CD3 complex s-* CD2-mediated signal transduction 
mimics that through the TcR-CD3 complex 2 ~4 7 . Activation of NK cells 
via CD2 is dependent on interaction with CD16 7 



Comments 

The genes for CD2 and CD58 are linked in the genome 2 . 



CD2 



Database accession numbers 

PIR SWISSPROT EMBL/GENBANK REFERENCE 
Human A28967 P06729 M16445 *** 

Rat A33071 P08921 X05H1 

Mouse S02293 P08920 Y00023 
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Amino acid sequence of human CD2 

MSFPCKFVAS FLLI FNVSSK GAVS 

KEITNALETW GALGQDINLD IPSFQMSDDI DDI KWEKTSD 

KETFKEKDTY KLFKNGTLKI KHLKTDDQDI YKVSIYDTKG 

KIQERVSKPK ISWTCINTTL TCEVMNGTOP ELNLYQDGKH 

KWTTSLSAKF KCTAGNKVSK ESSVEPVSCP EKGLDHJJX 

VFlfAILVFYI T KRKKQRSRR NDEELETRAH RVATEERGRK 

NPATSQHPPP PPGHRSQAPS HRPPPPGHRV QHQPQKRPPA 
GPPLPRPRVQ PKPPHGAAEN SLSPSSN 



-1 

KKKIAQFRKE 50 
KNVLEKI FDL ,100 
LKLSQRVITH 150 
GICGGGSLLM 200 
PQQIPASTPQ 250 
PSGTQVHQQK 300 
327 
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' T4, L3T4 (mouse), W3/25 (rat) 



Molecular weights 
Polypeptide 48400 



SDS PAGE 
reduced 
unreduced 



55 kD 
55 kD 
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N-linked sites 2 
Olinked nil 

Human gene location and size 
12pter-pl2 ; 33 kb * 
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Tissue distribution 

CD4 is expressed on most thymocytes and approximately two thirds of 
peripheral blood T cells, which constitute the CD8 negative cells A In 
human and. rat but not in mouse, CD4 is expressed on monocytes and 
macrophages 2 . 

Structure 

The extracellular domain is made up of four IgSF domains. The structures 
of the amino terminal two domains have been determined by X-ray 
crystallography, conftrrrung that they are Ig-like 3 ' 4 . Domain 2 is 
characterized by an unusual disulphide within one beta sheet and domain 3 
lacks a disulphide in the position conserved in most IgSF domains. Cat CD4 
shows some unusual features with 17 residues inserted between domains 1 
and 2 5 . There is an additional Cys in domain 1 and the Cys in the unusual p 
strand C position in domain 2 is replaced with a Trp and there is an extra 
Cys in the p strand F 5 . The position of the NH r terminus has been 
established for the rat homologue 6 . 

Function 

CD4 is an accessory molecule in the recognition of foreign antigens in 
association with MHC Class H antigens by T cells 2 . MAbs against CD4 
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Comments 

CD4 shows part 
protein [see pagi 
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RRQAERMSQI 1 

Database accdj 

Human i 
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Mouse | 




Amino acid s 

MNRGVPFRHL i 
KKVVLGKKGD * 
LNDRAD5RRS \ 
TANSDTHLLQ % 
LQDSGTWTCTJ 
AFTV EKLTGS | 
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inhibit T cell functions in vivo and in vitro 2 . The cytoplasmic domain of 
CD4 is phosphorylated at Ser residues 408, 415, 431 (see below) when T 
cells are activated by antigen or phorbol esters 7 

interacts with a lymphocyte-specific tyrosine kinase called^ p56kk through , a 
motif shown below » CD4 is a receptor for HIV- 1 (AIDS virus) and the 
binding of the viral gpl20 protein is to a region of the ammo terminal 
domain 

Comments 

CD4 shows particularly close similarities in overall structure to the LAG-3 
protein (see page 342). 

Motifs involved in CD4 function 

p56 lck recognition site (underlined) and Ser residues phosphorylated (in bold) 
RRQAERMSQI KRLLSE KKTC Q CPHRFQKTC SPI (433) 

Database accession numbexs^^^ mBL/GEmANK REFERENCB 

Human AO2109 P01730 M12807 * 

Rat A27449 P05540 M15768 6 

Mouse A02110 P06332 M13816 



Amino acid sequence of human CD4 




MNRGVPFRHL 
KKVVLGKKGD 
LNDRAOSRRS 
TANSDTHLLQ 
LQDSGTWTCT 
AFTVEKLTGS 
KKLPLHLTLP 
LTCEVWGPTS 
GQVLLESNIK 
RRQAERMSQI 
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LLVLQLALLP AATQG 
JVELTCTASQ KKSIQFHWKN- 
LWDQGNFPLI IKNLK1EDS0 
GQSLTLTLES PPGSSPSVQC 
VLQNQKKVEF KIDIVVLAFQ 
GELWWQAERA SSSKSWITFO 
QALPQYAGSG NLTLALEAKT 
PKLHLSLKLE NKEAKVSKRE 
VLPTWSTPVQ PMALIVLGGV 



KRLLSEKKTC QCPHRFQKTC SPI 



SNQIKILGNQ 
TYICEVEDQK 
RSPRGKNIQG 
KASSIVYKKE 
LKNKEVSVKR 
GKLHQEVNLV 
KAVWVLNPEA 
AfiM IFIfiLB 



-1 

GSFLTKGPSK- - -50- 
EEVQLLVFGL 100 
GKTLSVSQLE 150 
GEQVEFSFPL 200 
VTQDPKLQMG 250 
VHRATQLQKN 300 
GMWQCLLSDS 350 
IFFCV. RCRHR 400 
433 
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Molecular weights 

Polypeptide 22919 

SDS PAGE 

reduced 40 kD 
unreduced 38 kD 

Carbohydrate 
N-linked sites 2 
O-linked probable + 

Human gene size and location 

17;3kb* 
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Tissue distribution 

CD7 is die earliest marker antigen expressed in the T lineage being found 
on T ceU precursors in fetal liver and thorax prior to thymic colonization 
and m thymus and bone marrow * C D7 is expressed on pluripotential 
haematopoietic cells most human thymocytes and a major subset of 
peripheral blood T cells « CD 7 is a marker for pluripotential stem ceU 
leukaemias and T cell acute lymphocytic leukaemia * 

Structure 

An Ig-like domain is separated from the transmembrane sequence bv 4 
repeats containing a high proportion of Pro, Ser and Thr residues 3 This 
region is likely to be O-glycosylated and to form an extended structure like 
that proposed for the CD8 hinge. The gene for CD7 is similar to the murine 
Thy-1 gene which places it in a class of tissue-specific genes whose 
promoters lack a TATA element 1 . 

Function 

The function of CD7 is unknown and no ligand has yet been identified A 
proposal that CD7 was an IgM receptor was not confirmed in studies on 
expression from cDNA a A role for CD7 in the activation of T cells with v/8 
receptors has been proposed based on mAb-induced activation f 
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PIR 
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SWISSPROT EMBL/GENBANK REFERENCE 
P09564 X06180 3 
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Amino acid sequence of human CD7 

MAGPPRLLLL PLLLALARGL PGA LA 

AQEVQQSPHC TTVPVGASVN ITCSTSGGLR GIYLRQLGPQ PQDI IYYEDG 

VVPTTDRRFR GRIDFSGSQD NLTITMHRLQ LSDTGTYTCQ AITEVNVYGS 

GTLVLVTEEQ SQGWHRCSDA PPRASALPAP PTGSALPDPQ TASALPDPPA 

acaip^iA Y i^nifilfiLfi VACVLA RTQI KKLCSWRDKN SAACVVYEDM 
SHSRCNTLSS PNQYQ 



References 

1 Schanberg, L.E. et al. (1991) Proc. Natl Acad- Sci. USA. 88, 603-607. 

2 Haynes, B.F. et al. (1989) Immunol. Today 10, 87-91. 

3 Aniffo, A. and Seed, B. (1987) EMBO J. 6, 3313-3316. 

4 Carrel, S. et al. (1991) Eur. J. Immunol. 21, 1 195-1200. 



-1 
50 
100 
150 
200 
215 



by 4 
This 
like 
nine 
hose 



d. A 
s on 
i Y /8 



CE 




CD16 



FcyRin 



Molecular weights 



Polypeptide Transmembrane form 27268 
50-80 kD 



SDS PAGE 
reduced 



Carbohydrate 

N-linked sites 5 
O-linked unknown 

Human gene location and size 
lq23 ; 9kb^ 
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Tissue distribution \ 

In humans the transmembrane form of CD16 is expressed on NK cells and 
macrophages, whilst the GPI-linked form is expressed on neutrophils. In the 
mouse no GPI-linked form of CD16 has been identified and the 
transmembrane form is expressed on macrophages, NK cells, neutrophils 
and myeloid precursors [reviewed in ref. 3]. 

Structure 

There are two distinct forms of human CD16 encoded by two linked genes: a 
transmembrane form with a_25 amino.acid cytoplasmic tail and a. glycosyl- 
phosphatidylinositol |GPI)-linked form 1 ^. Their extracellular sequences 
differ by only six amino acids and site directed mutagenesis showed that 
amino acid Serjgg (mature numbering) determines the attachment of the GPI 
anchor The extracellular region of CD16 is formed by two C2-set IgSF 
domains 4 . The transmembrane form is non-covalently associated with the 
FceRly chain 5 or the TcR£ chain *, forming a multimeric complex, and its 
surface expression is dependent on the co-expression of the y or £ chains 5 A It 
has been shown that on mast cells CD16 is also associated with the p chain of 
the FceRI 7 . There are two alleles of the GPI-linked form 3 . Both CD 16 genes 
have identical intron exon boundaries 3 . 

Function 

CD 16 is a low affinity receptor for aggregated IgG 3 . The transmembrane 
form binds IgG complexed to antigens and mediates phagocytosis and 
antibody-dependent cellular cytotoxicity. Signal transduction by CD 16 is 
mediated through the y chain 8 . Cross-linking of CD 16 on NK cells with 
immune complexes or mAbs induces calcium mobilization and hydrolysis 
of membrane phosphoinositides. In contrast the GPI-linked form of 
neutrophils binds tie ligands but is unable to induce any signal or 
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functional effect [reviewed in ref. 3 ]. The engagement of the transmembrane 
form results in the tyrosine phosphorylation of the TcR£ chain 9 . 



Comments 

CD16 (FcyRIII) has structural similarity to CD64 (FC7RIJ, CDw32 (FcyRri) 
and FceRIa 3 . The sequence LFAVDTGL is completely conserved in the 
transmembrane regions of human and mouse CD 16 and human, mouse and 
rat FceRIa 10 . A family of CD 16 isoforms have been described in the rat 10 . 
Patients with paroxysmal nocturnal haemoglobinuria lack the GPI-linked 
form of CD 16 and this is probably responsible for the presence of circulating 
immune complexes and the susceptibility to bacterial infections associated 
with this disease 11 . 

Database accession numbers 

PIR SWISSPROT EMBL/GENBANK REFERENCE 
Human JL0107 P08637 X16863 * 

Mouse P08508 \ M14215 12 

Rat M64368-M64370 ™ 

Amino acid sequence of human CD 16 

Transmembrane form 

MWQLLLPTAL LLLVSA6 

MRTEDLPKAV VFLEPQWYRV LEKDSVTLKC QGAYSPEDNS TQWFHNESLI 

SSQASSYFID AATVDDSGEY RCQTNLSTLS DPVQLEVHIG WLLLQAPRWV 

FKEEDPIHLR CHSWKNTALH KVTYLQNGKG RKYFHHNSDF Y1PKATLKDS 

GSYFCRGLFG SKNVSSETVN ITITQGLAVS TISSFFPPGY QVSFCLVMVL 

LFAVDTGLYF SV KTNIRSST RDWKDHKFKW- RKDPQDK 

GPI-Unked form 
MWQLLLPTAL LLLVSAG 

MRTEDLPKAV VFLEPQWYRV LEKDSVTLKC QGAYSPEDNS TQWFHNESLI 

SSQASSYFID AATVDDSGEY RCQTNLSTLS DPVQLEVHIG WLLLQAPRWV 

FKEEDPIHLR CHSWKNTALH KVTYLQNGKG RKYFHHNSDF YIPKATLKDS 

GSYFCRGLFG SKNVSSETVN ITITQGLAVS TISSFS 
PPGYQVSFCL VMVLLFAVDT^ GLYFSVKTNI 
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Polypeptide 59154 
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Tissue distribution 



Encoded 
by 9 exons 



CD19 is expressed on all human B cells and on B cell precursors before 
CD10 (CALLA antigen) and the intracytoplasmic u. chain. It is not expressed 
on plasma cells. CD19 is also present on follicular dendritic cells 2 . Mouse 
CD 19 mRNA is expressed in B cell lines and spleen *. 

Structure 

The extracellular region of CD19 consists of two C2-set IgSV domains 
separated by a region that contains a pair of Cys residues but has no 
significant sequence homology with any known protein 2 . The large cyto- 
plasmic domain is highly conserved between species 1 and contains several 
potential phosphorylation sites on Ser and Thr and nine Tyr residues 2 . 

Function 

CD19 is involved in the regulation of B cell proliferation. Coligation of 
CD19 with mlg decreases the threshold for rrtfg-dependant stimulation 3 . 
However cross-linking of CD19 molecules independently from mlg inhibits 
the increase in intracellular calcium and proliferation induced by anti-Ig 4 . 
In addition, antibody cross-linking of CD19 alone in B and pre-B cells 
induces an increase in the intracellular calcium levels, without leading to B 
cell activation 4 A CD 19 associates with the complement receptor 2 
(CD21) *> TAPA-1 and Leul3 antigens 7 forming a signal transduction 
complex on the membrane of B cells. 
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Database accession numbers _ 

PIR SWISSPROT EMBL/GENBANK REFERENCE 



Human 



M28170 2 



-1 



Mouse M62542 
Guinea pig M62543 1 

Amino acid sequence of human CD 19 

EVRPEEPLVV KVEEGDNAVL QCLKGTSDGP TQQLTWSRES PLKPFLKLSL 50 

GLPGLGIHMR PLASWLFI FN VSQQMGGFYL CQPGPPSEKA WQPGWTVNVE 100 

GSGELFRWNV SDLGGLGCGL KNRSSEGPSS PSGKLMSPKL YVWAKDRPEI 150 

WEGEPPCVPP ROSLNQSLSQ DLTHAPGSTL WLSCGVPPDS VSRGPLSWTH 200 

VHPKGPKSLL SLEUCDDRPA RDMWVHETGL LLPRATAQOA GKYYCHRGNL 250 

TMSFHLEITA RPVLWHWLLR TGGWKYSAVT UYITFCKS LVGUH10RA 300 

LVLRRKRKRM TDPTRRFFKV TPPPGSGPQN QYGNVLSLPT PTSGLGRAQR 350 

WAAGLGGTAP SY6NPSSDVQ ADGALGSRSP PGVGPEEEEG EGYEEP0SEE 400 

DSEFYENDSN LGQDQLSQDG SGYENPEDEP LGPEDEDSFS NAESYENE0E 450 

ELTQPVARTM DFLSPHGSAW DPSREATSLG SQ$YEDMRGI LYAAPQLHSI 500 

RGQPGPNHEE DADSYENMDN PDGPDPAWGG GGRMGTWSTR 540 
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Molecular weights 
Polypeptide CD22oc 70991 

SDS PAGE 

reduced 130 kD/140 kD 

heterodimer 
unreduced 120 kD/130 kD 

heterodimer 

Carbohydrate 

N-linked sites CD22ct, 10 
CD22p\ 11 

O-linked CD22a, nil 
CD22p, nil 



C2} 



C25 



amm 



iuufum 



oo^ f"^ f*^ cmv^ p,^ CSS 



r*? rvsg p*™? 

ca i cz I ca iTwfcrl 



Tissue distribution 

CD22 is expressed on a subset of Ig+ mature B cells (60-80%) and on hairy 
cell leukaemia cells. It is also present in the cytoplasm of pro-B, pre-B and 
mature B cells. CD22 expression is lost after B cell activation *. 

Structure 

Two forms of CD22 have been described. The a form of CD22 has five IgSF 
domains * The 0 form is identical to the a form except that it has two 
additional IgSF domains (domains three and four) and a cytoplasmic domain 
23 amino acids longer than that of the a form It has not been established 
whether these forms arise by alternative splicing. The sequence of CD22 is 
similar to that of myelin-associated glycoprotein V and both have been 
predicted to have an unusual disulphide bond between p strands B and E in 
the first domain 4 . The cytoplasmic region contains possible protein kinase 
C and casein kinase II phosphorylation sites and six Tyr residues 2 . 



Function | 

CD22a hi 
monocytes 
CD4+ T cei 
B cells ha\ 
CD22- eel 
intracellul* 
linking of 
proliferatio 

Commenl 

CD22 is a 
from two d 

Database a 

t 

Human jf 



Amino ac 

MH L LG P W Lfi 
DSSKWVFH 
KFDGTRLY| 
RMESKTEki 
YPIQLQWtii 
QLQDADGK! 
SNPSVTRY] 
VALNVQYAl 
GRLLGKESi. 
RVSMSPGDi 
EPVKVQHSt 

HSLGCYNPy 
SALHKRQVI 

I 

Amino ac 

Sequence i 



Reference 

Stamen 1 

3 



Wilsonl 
Stamen 
Williarr 
CSHPf 
Pezzuti 
Schwar 




c 



Function 

CD22a has been shown to mediate adhesion of transfected cells to 
monocytes and erythrocytes 1 and CD22[J to mediate adhesion of B cells and 
CD4 + T cells & The CD45RO antigen on T cells and the CD75 antigen on 
B cells have been identified as ligands for CD22B *. CD22+ cells, but not 
CD22- cells, respond to cross-linking with anti-Ig with an increase in 
intracellular calcium and a strong proliferative response s . MAb cross- 
linking of CD22 antigen augments the increase in calcium and the 
proliferation induced by anti-Ig 5 . 



Comments 

CD22 is an heterodimer of 130 and 140 kD glycoproteins that originate 
from two different precursor molecules 6 . 

Database accession numbers ^^t^t^ 
PIR SWISSPROT EMBL/GENBANK REFERENCE 

Human A35648 P20273 \ 1 



Amino acid sequence of human CD22a 



HHLLGPWLLL 
DSSKWVFEHP 
KFDGTRLYES 
RHESKTEKWM 
YPIQLQWLLE 
QLQDADGKFL 
-SNPSVTRYEW 
VALNVQYAPR 
GRLLGKESQL 
RVSMSPGDQV 
EPVKVQHSGA 
ATI ILAICGL 
HSLGCYNPHM 
SALHKRQVGT 



LVLEYLAFS 

ETLYAWEGAC 

TKDGKVPSEQ 

ERIHLNVSER 

GVPMRQAAVT 

SNDTVQLNVK 

KPHGAWEEPS 

DVRVRKIKPL 

NFDSISPEDA 

MEGKSATLTC 

YWCQGTNSVG 

KLQRRWKRTQ 

EDGISYTTLR 

MRTSFQI FQK 



VWIPCTYRAL 
KRVQFLGDKN 
PFPPHIQLPP 
STSLTIKSVF 
HPPKKVTTVI 
LGVLKIQNVG 
SEIHSGNSVS 
GSYSCWVNNS 
ESDANPPVSH 
KGRSPLSTLT 
SQQGLQENSS 
FPEMNIPRTG 
HRGFITQS 



DGDLESFILF 
KNCTLSIHPV 
EIQESQEVTL 
TRSELKFSPQ 
QNPMPIREGO 
WDNTTIACAA 
LQCDFSSSHP 
IGQTASKAWT 
YTWFDNNNGS 
VYYSPETIGR 
GQSFFVRNKK 
DAESSEMQRP 



-1 

HNPEYNKNTS 50 

HLNDSGQLGL 100 

TCLLNFSCYG 150 

WSHHGKIVTC 200 

TVTLSCNYNS 250 

CNSWCSWASP- 300- 

KEVQFFWEKN 350 

LEVLYAPRRL 400 

LPYHSQKLRL 450 

R VAVGLGSCL 500 

VRRAPLSEGP 550 

PPDCD0TVTY 600 
628 



Amino acid sequence of human CD22J5 

Sequence not available in EMBL/GENBANK databases. 
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CD28 



Tp44 



3S 



Molecular weights 

Polypeptide 23085 

SDS PAGE 

reduced 44 kD 

unreduced 90 kD 

Carbohydrate 

N-linked sites 5 



mmn( am 



O-linked 



unknown 



Human gene location and size 
2q33-q34 ; 36 kb 1 



COOH COOH 



Domain 



Exon boundaries 



CKY 



YF9 



TGN 



TM 



KGK 



13 



0 

FWVH 



Tissue distribution 

CD28 is expressed on most T lineage cells and plasma cells 2 . Mature 
thymocytes have higher levels of CD28 than the immature cells and among 
peripheral T cells, 95% of CD4 + cells and 50% of CD8 + cells are positive 2 . 
Activation of T cells leads to enhanced CD28 expression 2 . 

Structure - 

CD28 is a member of the IgSF and is expressed as a disulphide-linked 
homodimer 2 A Human and mouse CD28 are 68% identical at the amino 
acid level 4 . CD28 is particularly similar to CTLA-4 with which it shares a 
ligand and probably a common ancestor in evolution 4 . 

Function 

The ligand for CD28 is B7 s > 6 which is expressed on activated B cells, 
suggesting an important role for CD28 in the interaction between T and B 
cells. Activation of T cells via CD28 has provided evidence for a CD28 
signalling pathway which involves stabilization of cytokine mRNA levels 
and is separate from that used by the TcR-CD3 complex 2 > 7 . 



Amino aci 

MLRLLLALjj 
NKI LVKQSj 
YGNYSQOLC 
PPPYLDNEif 
UT V AFUF 
RS I 



Referenced 

1 Lee, Of 

2 June, cf 

3 AruffoJ 

4 Gross, jj 

5 Linsleyj- 

6 linsleyj 

7 Lindster 

8 Clark, (§ 

a 



Database accession numbers 

PIR SWISSPROT EMBL/GENBANK REFERENCE 
Human P10747 J02988 3 

Mouse M34563 4 

Rat X55288 * 




Amino acid sequence of human CD28 

MLRLLLALNL FPSIQVTG 

NKILVKQSPM LVAYDNAVNL SCKVSYNLFS REFRASLHKG LDSAVEVCVV 

YGNYSQQLQV YSKTGFNCDG KLGNESVTFY LQNLYVNQTD I YFCKIEVMY 

PPPYLDNEKS NGTIIHVKGK HLCPSPLFPG PSKP FWVLVV VGGVLArYSI 

UTVAFUFW VRSKRSRLLH SDYNNMTPRR PGPTRKHYQP YAPPRDFAAY 
RS 
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CD33 



□ 



Molecular weights 
Polypeptide 37908 



SDS PAGE 
reduced 



67 kD 



Carbohydrate 

N-linked sites 5 
O-linked nil 

Human gene location and size 
19 q 13.3; <35 kb * 




COOK 



Domains 



CTF 



HE 



SYF 

E±I 



CSV 



C2 



■™)CYl 



Tissue distribution 

CD33 is found on granulocyte and macrophage precursors in the bone 
marrow, but is not on pluripotent stem cells. The protein is also expressed 
on peripheral monocytes, and is a useful marker for distinguishing 
myelogenous leukaemia cells from lymphoid or erythroid leukaemias z . 

Structure 

The cDNA sequence of CD33 codes for two IgSF domains with closest 
similarity to the NH 2 -terminal domains of myelin-associated glycoprotein 
and CD22 2 . The disulphide bond in domain 1 of CD33 is predicted to be 
between p strands B and E with a Phe residue in the position of the 
conserved Cys of p strand F 3 . There is likely to be a disulphide bond 
between domains 1 and 2. 

Function 

Unknown. 

Comments 

Expressed by myeloid precursors after CD34 f The CD33 gene is located 
near the gene for myelin-associated glycoprotein which has been localized 
to position 19 q 13.1. 



Database accession numbers 

PIR SW1SSPROT EMBL/GENBANK REFERENCE 
Human A30521 P20138 M23197 * 



Amino acid si 

I 

MPLLLLLPLL VI 
DPNFWLGVQE 5 
SGDSPVATNK i 
FRMERGSTKY < 
ACEQGTPPIF i 
GAGVTTERTI j 
TAU.ALCICL 
GPTETSSCSG / 



References \ 
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Quantitatiy 
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CD3 



I 



Amino acid sequence of human CD33 



MPLLLLLPLL 
DPNFWLQVQE 
SGDSPVATNK 
FRMERGSTKY 
ACEQGTPPIF 
6AGVTTERTI 
TALLALCLCL 



WAGALAM 

SVTVQEGLCV LVPCTFFHP1 
LDQEVQEETQ GRFRLLGDPS 
SYKSPQLSVH VTDLTHRPKI 
SWLSAAPTSL GPRTTHSSVL 
QLNVTYVPQN PTTGI FPGDG 
I FFI VKTHRR KAARTAVGRN 



GPTETSSCSG AAPTVEMDEE LHYASLNFHG 



PYYDKNSPVH 
RNNCSLSIVD 
UPGTLEPGH 
I ITPRPQDHG 
SGKQETRAGV 
DTHPTTGSAS 
MNPSKDTSTE 



-1 

GYWFREGAI I 50 

ARRRDNGSYF 100 

SKNLTCSVSW 150 

TNLTCQVKFA 200 

VH GAI6GAGV 250 

PKHQKKSKLH 300 

YSEVRTQ 347 
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CD38 



TIO 



Molecular weights 
Polypeptide 34302 



SDS PAGE 
reduced 
unreduced 



45 kD 
45 kD 



Carbohydrate 

N-linked sites 4 
O-linked unknown 

Human gene location 
4 



m?>mt 



NH 2 



Tissue distribution 

CD38 is found on early cells of the B and T cell lineages, on activated B and 
T cells, but not on most mature resting peripheral lymphocytes. It is present 
on thymocytes, pre-B cells, germinal centre B cells, mitogen-activated T 
cells, and Ig-secreting plasma cells 

Structure 

CD38 is a type D membrane glycoprotein, with the transmembrane 
sequence near the NH 2 - terminus 1 . 

Function 

Unknown, but anti-CD38 mAbs rj^cgmbinatipn. with accessory cells. are 
fnitbgenic for T cells and a variety of cell lines 3 . 

Database accession numbers 

PIR SWISSPROT EMBL/GENBANK REFERENCE 
Human M34461 1 



Amino acid sequence of human CD38 



MANCEFSPVS 
SGPGTTKRFP 
ITEEDYQPLM 
GYLADDLTWC 
CDVVHVMLNG 
RDLCQDPTIK 



GDKPCCRLSR 
ETVLARCVKY 
KLGTQTVPCN 
GEFNTSKI NY 
SRSKIFDKNS 
ELESI ISKRN 



R AQLCLGV5I LVLILVVVLA VVVP RWRQTW 50 



TEIHPEMRHV 
KILLWSRIKD 
QSCPDWRKDC 
TFGSVEVHNL 
IQFSCKNIYR 



OCQSVWDAFK 
LAHQFTQVQR 
SNNPVSVFWK 
QPEKVQTLEA 
PDKFLQCVKN 



GAFISKHPCN 100 

DMFTLEDTLL 150 

TVSRRFAEAA 200 

WVIHGGREDS 250 

PEDSSCTSEI 300 



References 
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Molecular weights 

Polypeptide 24854 



SDS PAGE 
reduced 



40-47 kD 



Carbohydrate 
N-linked sites 6 
O-linked unknown 

Human gene location 
Iq21-q23 




mutm 



Domains j SE YIM . ? V ' YTC 

IsTv 1 C2 ~[G] 



Tissue distribution 

Expressed on all peripheral blood T, B and null cells as well as on 
thymocytes, 30% of bone marrow cells, eosinophils (stronger than on 
neutrophils) and epithelium of bronchus and salivary glands, but not on 
erythrocytes V. In the rat the MRC OX-45 antigen is expressed on all 
haematopoietic, cells including erythrocytes and on endothelium A All 
mouse thymocytes and lymph node cells as well as most macrophages and 
bone marrow cells express CD48, but erythrocytes do not express detectable 
amounts * 



Structure 

CD48 consists of an Ig V-like domain which lacks a disulphide bond and a 
Ig C2set domain which is linked to the lipid bilayer by a GPI anchor S A 
The site of GPI attachment is shown as Ser 194 on the basis of homology 
with the rat sequence 7 . The sequence of CD48 matches particularly well 
with that of CD58 (LFA-3) and CD2 V. 



i*| Function 

In the rat system, anti-CD48 mAbs inhibit the MLR by activation of non- 
specific suppression by accessory cells, probably macrophages * A recent 
study showed that GPI -anchored molecules of human and mouse T 
lymphocytes can associate with tyrosine kinases (p56 Ick in the case of 
human CD48) 9 . This interaction may be the mechanism for signal 
transduction by GPI -anchored molecules. 



Comment 

CD2 and CD58 (LFA-3) are linked to an ATPase gene in man and mouse. 
A similar linkage for the CD48 gene to another ATPase gene suggests 
that these linkage groups arose by duplication of a large chromosomal 
fragment 4 . 




Database accession numbers 

PIR SWISSPROT EMBL/GENBANK REFERENCE 
Human A27236 P09326 X06341,M37766, 6 

M59904 

Rat S01299 P10252 X13016 7 

Mouse JL0143 P18181 X17501 * 



Amino acid sequence of human CD48 

MWSRGWDSCL ALELLLLPLS LLVTSI -1 

QGHLVHHTVV SGSNVTLNIS ESLPENYKQL TWFYTFDQKI VEVIDSRKSKY 50 

FESKFKGRVR LDPQSGALY1 SKVQKEDNST YIMRVLKKTG NEQEWKIKLQ 100 

VLDPVPKPVI KIEKIEOHDD NCYLKLSCVI PGESVNYTWY GDKRPFPKEL 150 

QNSVLETTLH PHNYSRCYTC QVSNSVSSKN GTVCLSPPCT LARS 194 

FGVEWIASHL VVTVPTILGL LLT 217 
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Molecular weights 

Polypeptide 55402 

SDS PAGE 

reduced 85-110 kD 

Carbohydrate 

N-linked sites 8 
O-linked unknown 

Human gene location and size 
19; 12 kb 1 * 




7TTT > TT 



COOH 



Oomains 



CST 



HE 



PMC 



CQV 



C2 



FSC 



CSL 



LTC 



CEA 

l FS 9 



CQA 



YLC 



C2 



C2 



C2 



| C2 itmicy! 



Tissue distribution 

CD54 has a wide tissue distribution and is expressed on both haematopoietic 
and nonhaematopoietic cells J A Expression on resting peripheral' blood 
leucocytes is weak but is upregulated on activated T and B lymphocytes 
and monocytes CD54 expression on endothelium and other 
nonhaematopoietic cells is upregulated by inflammatory mediators 



Structure 

The extracellular domain is made up of five IgSF domains *> z . Within the 
IgSF, CD54 is most similar to ICAM-2 which has two IgSF domains that are 
closely related to CD54 domains 1 and 2 2 A 



Function 



CD54 mediates cell adhesion by binding to the integrins CDlla/CD18 
(LFA-1) and to CDllb/CD18 (Mac-l)f This adhesion is not due to 
interaction with an epitope containing Arg-Gly-Asp (RGD) because this 
sequence is absent from CD54 1 > 3 . The first NH 2 -terminal domain of CD54 
binds to CDlla/CD18 (LFA-1 J 3 and the third to CDllb/CD18 f The first 
domain of CD54 also binds to rhinovirus 3 . The interaction of CD54 with 
CD1 la/CD 18 enhances antigen-specific T cell activation 5 . CD54 has also 
been reported to bind to CD43 6 and to Plasmodium falciparum- infected 
erythrocytes 9 > w . 



Database accession numbers 

PIR SWISSPROT EMBL/GENBANK REFERENCE 
Human S00573 P05362 x X06990 

Mouse S06015 P13597 ' X52264 



Amino acid sequence of human CD54 



MAPSSPRPAL 
NAQTSVSPSK 
RKVYELSNVG 
PVGKNLTLRC 
RRDHHGANFS 
VLEVDTQGTV 
VSVTAEDEGT 
TEVTVKCEAH 
EVAGQLIHKN 
LPELKCLKDG 
SPRYE I V HT 
NTQATPP 



PALLVLLGAL 
VILPRGGSVL 
EDSQPMCYSN 
QVEGGAPRAN 
CRTELDLRPQ 
VCSLDGLFPV 
QRLTCAVILG 
PRAKVTLNGV 
QTRELRVLYG 
TFPLPIGESV 
VVAAAVIMGT 



FPGPG 

VTCSTSCDQP 
CPDGQSTAKT 
LTVVLLRGEK 
GLELFENTSA 
SEAQVHLALG 
NQSQETLQTV 
PAQPLGPRAQ 
PRLDERDCPG 
TVTROLEGTY 
AfiUIIIJNR 



KLLGI ETPLP 

FLTVYWTPER 

ELKREPAVGE 

PYQLQTFVLP 

D.ORLHPJVTY- 

TIYSFPAPNV 

LLLKATPEDN 

NWTHPENSQQ 

LCRARSTQGE 

QRKIKKYRLQ 



7 




8 






-1 


KKELLLPGNN 


50 


VELAPLPSWQ 


100 


PAEVTTTVLV 


150 


ATPPQLVSPR 


200 


GNDSFSAKAS- 


250- 


ILTKPEVSEG 


300 


GRSFSCSATL 


350 


TPMCQAWGNP 


400 


VTREVTVNVL 


450 


QAQKGTPMKP 


500 




507 
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CD58 



LFA-3 



Molecular weights 

Polypeptide: transmembrane 25339 

SDS PAGE 

reduced 55-70 kD 
unreduced 55-70 kD 

Carbohydrate 

N-linked sites 6 

O- linked unknown 

Human gene location 
lpl3* 



OR 



Aim 5 A ii 



COOH 



Domains 



FHV 



YEM 



CMI 



IOC 



HE 



C2 



|TM | CYf 



Tissue distribution 

CDS 8 has a wide tissue distribution on haematopoietic and various 
nonhaematopoietic cells including endothelium , 2 Af It is strongly expressed 
on macrophages and is present on germinal centre B cells 4 , medullary 
thymocytes 4 and memory T lymphocytes 3 . Expression of CD58 on human 
cortical thymic epithelium is disputed 4 . The sheep CD58 homologue is 
expressed on erythrocytes and mediates the phenomenon of E resetting, i.e. 
the rosettes formed spontaneously between sheep erythrocytes and human 
T cells. 



Structure 

The extracellular domain is similar to its ligand CD2 and also to CD48 5 . It 
is made up of two IgSF domains with the NH 2 -terminal one lacking the 
normally conserved disulphide bond *&3,5 m Alternative splicing gives rise to 
a transmembrane form and a glycosyl-phosphatidylinositol anchored form 
2 A The NH 2 - terinmus °f me mature polypeptide has been established by 
protein sequencing 8 . 



Function 

CD58 mediates cell adhesion through the binding of the extracellullar 
domain to the NH2-terminal domain of CD2 I A 3 . Differences in function 
between the transmembrane and glycolipid-anchored form are unknown 2 A 
The affinity of CD58 for CD2 is in the order of 10 6 M Binding of CD58 
to CD2 can enhance antigen-specific T cell activation J A A role for CDS 8 in 
signal transduction is suggested by the CD58-stimulated release of IL1 from 
thymic epithelial cells 6 or monocytes 7 . 



Comments 

The genes for CD58 and CD2 are linked in the genome 1 . 



Database accession numbers 

PIR SWISSPROT EMBL/GENBANK REFERENCE 

Human A28564 P19256 Y00636 * 

Human (GPI) S01269 X06296 9 



Amino acid sequence of human CD58 

MLRLLLALNL FPSIQVTGNK ILVKQSPM -1 

FSQQIYGVVY GNVTFHVPSN VPLKEVLWKK QKDKVAELEN SEFRAFSSFK 50 

HRVYLOTVSG SLTIYNLTSS DEDEYEMESP NITDTMKFFL YVLESLPSPT 100 

LTCALTNGSI EVQCMIPEHY NSHRGLIMYS WDCPMEQCKR NSTSIYFKME 150 

NDLPQKIQCT LSNPLFNTTS SI I LTTC I PS % SGHSRHRYAL_!_PJ2LAVI1I 200 

CIVLYMMGIL KCDRKPDRTN SN 222 

GPI-anchored form |Ser i80 is a potential site for GPI attachment) 

SGHSRHRYAL IPIPLAVITT CI VLYMNVL 209 
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FcyRI 



Hi 

hi i ■ 



Molecular weights 
Polypeptide 40833 



SDS PAGE 
reduced 



72 kD 



Carbohydrate 

N-linked sites 7 
O-linked unknown 

Human gene location and size 
lqj 9.4 kb 1 




COOH 



CEV 

Domains | YRC 
/VDT 



CHA 



YHC 



CET 



C2 



RGW 



| YWC 



C2 



WVP 



h 

KEL 



|TM|CY| 



LGL 



Tissue distribution 

- Human CD64 is expressed on macrophages, monocytes and interferon y 
induced neutrophils. Interferon y treatment strongly upregulates expression 
on monocytes. Mouse CD64 is expressed on macrophages 

Structure 

The extracellular region consists of three C2-set IgSF domains. The first two 
domains are about 50% identical to the extracellular domains of CDw32 
(FcyRH) and CD16 (FcTRmj and altogether the Fc receptors form a closely 
related subgroup within the IgSF 4 The third and less closely related domain 
has been proposed to be responsible for the high affinity binding of CD64 to 
IgG 4f Two alleles of human CD64 have been described 2 . A third form 
with a different cytoplasmic sequence may be the result of a cloning artifact 



Function 

CD64 is a high affinity receptor for IgG and the only IgG receptor which 
binds to monomeric ligand 2 . The FcyRl binding site in IgGi has been 
mapped to the CH2 domain « FcyRI plays a central role in antibody- 
dependent cytotoxicity and clearance of immune complexes 3 . 
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Database accession numbers 

PIR SWISSPROT EMBL/GENBANK REFERENCE 

Human S03018 P12314 X14356 « 

S03019 P12315 X14355 4 

Mouse M31314 * 



Amino acid sequence of human CD64 

SWISSPROT: P12314 

MWFLTTLLLW VPVOG -1 

T* V* 

QVOTTKAVIS LQPPWVSVFQ EETVTLHCEV LHLP6SSSTQ WFLNGTATQT 50 

STPSYRITSA SVNDSGEYRC QRGLSGRSDP IQLEIHRGWL LLQVSSRVFT 100 

EGEPLALRCH AWKDKLVYNV LYYRNGKAFK FFHNNSNLTI LKTNISHNGT 150 

YHCSGMGKHR YTSAGISVTV KELFPAPVLN ASVTSPLLEG NLVTLSCETK 200 

LLLQRPGLQL YFSFYMGSKT LRGRNTSSEY . QI LTARREDS GLYWCEAATE 250 

DGNVLKRSPE LELQVLGLQL PTPVWFH VLF 'YLAVGIMFLV NTVLWVTI RK 300 

ELKRKKKWDL EISLDSGHEK KVTSSLQEDR HLEEELKCQE QKEEQLQEGV 350 

HRKEPQGAT 359 
* Allelic residues in human CD64. 

SWISSPROT: P 123 15. Possible cloning artifact. 

MWFLTTLLLW VPVDG -1 

QVDTTKAVIT LQPPWVSVFQ EETVTLHCEV LHLPGSSSTQ WFLNGTATQT 50 

STPSYRITSA SVNDSGEYRC QRGLSGRSDP IQLEIHRGWL LLQVSSRVFT 100 

EGEPLALRCH_ AWKDKLVYNV LYYRNGKAFK . FFHWNSNLTI LKTNISHNGT -150 

YHCSGMGKHR YTSAGISVTV KELFPAPVLN ASVTSPLLEG NLVTLSCETK 200 

LLLQRPGLQL YFSFYMGSKT LRGRNTSSEY QILTARREDS GLYWCEAATE 250 

DGNVLKRSPE LELQVLGLQL PTPVUFH VLF YLAVGIMFLV NTVLWVTI RK 300 

ELKRKKKWDL EISLDSGGQA LEAPTQGCA 329 
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